Eigenvector dynamics: theory and some applications 
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We propose a general framework to study the stability of the subspace spanned by P consecutive 
eigenvectors of a generic symmetric matrix Ho, when a small perturbation is added. This problem 
is relevant in various contexts, including quantum dissipation (Ho is then the Hamiltonian) and risk 
control (in which case Hq is the assets return correlation matrix). We specialize our results for the 
case of a Gaussian Orthogonal Hq, or when Hq is a correlation matrix. We illustrate the usefulness 
of our framework using financial data. 



Random Matrix Theory (RMT) is extraordinary pow- 
erful at describing the eigenvalues statistics of large ran- 
dom, or pseudo-random, matrices Eigenvalue den- 
sities, two-point correlation functions, level spacing dis- 
tributions, etc. can be characterized with exquisite de- 
tails. The "dynamics" of these eigenvalues, i.e. the way 
these eigenvalues evolve when the initial matrix Hq is 
perturbed by some small matrix eP, is also well under- 
stood [5]. The knowledge of the corresponding eigenvec- 
tors is comparatively much poorer. One reason is that 
many RMT results concern rotationally invariant matrix 
ensembles, such that by definition the statistics of eigen- 
vectors is featureless. Still, as we will show below, some 
interesting results can be derived for the dynamics of 
these eigenvectors. Let us give two examples for which 
this question is highly relevant. 

One problem where the evolution of eigenvectors is im- 
portant is quantum dissipation As the parameters 
of the Hamiltonian Ht = Hq -I- ePt of a system evolve 
with time t, the average energy changes as well. One 
term corresponds to the average (reversible) change of 
the Hamiltonian that leads to a shift of the energy lev- 
els (the eigenvalues) . But if the external perturbation is 
not infinitely slow, some transitions between energy lev- 
els will take place, leading to a dissipative (irreversible) 
term in the evolution equation of the average energy of 
the system. The adiabaticity condition which ensures 
that no transition takes place amounts to comparing the 
speed of change of the perturbation ePt with a quantity 
proportional to the typical spacing between energy levels. 
For systems involving a very large number N of degrees 
of freedom, the average level spacing of the NxN Hamil- 
tonian H goes to zero as iV~^. For — > oo, any finite 
speed of change therefore corresponds to the "fast" limit, 
where a large number of transitions between states is ex- 
pected. In fact, if the quantum system is in state |(/)^) 
at time t = Q, that corresponds to the ith eigenvector 
of Hq, the probability to jump to the jth eigenvector of 
Hi, \4>j), at time i = 1 is given by |(</>] |(/)°) where we 
use the bra-ket notation for vectors and scalar products. 
The way energy is absorbed by the system will there- 
fore be determined by the perturbation-induced distor- 
tion of the eigenvectors. More precisely, if l^i^) is different 



from \<pl), some transitions must take place in the non- 
adiabatic limit, that involve all the states j which have a 
significant overlap with the initial state. 

Another very relevant situation is quantitative finance, 
where the covariance matrix C between the returns of 
N assets (for example stocks) plays a major role in risk 
control and portfolio construction [5]. More precisely, 
the risk of a portfolio that invests Wa in asset a is given 
ap WaCapwp. Constructing low risk portfo- 
lios requires the knowledge of the n largest eigenvalues 
of C, Ai < • • • < A„ and their corresponding eigenvec- 
tors |0i), . . . , |0„). A portfolio such that the vector of 
weights \w) has zero overlap with the first n eigenvectors 
of C has a risk that is bounded from above by Xn+i- The 
problem with this idea is that it relies on the assumption 
that the covariance matrix C is perfectly known and con- 
stant in time. The observation of a sufficiently long time 
series of past returns would allow one, in such a stable 
world, to determine C and to immunize the portfolio 
against risky investment modes. Unfortunately, this idea 
is thwarted by two (inter-related) predicaments: a) time 
series are always of finite length, and lead to substantial 
"noise" in empirical estimates of C [F and b) the world is 
clearly not stationary and there is no guarantee that the 
covariance matrix corresponding to the pre-crisis period 
2000-2007 is the same as the one corresponding to the 
period 2008-2011. For one thing, some companies disap- 
pear and others are created in the course of time. But 
even restricted to companies that exist throughout the 
whole period, it is by no means granted that the correla- 
tion between stock returns do not evolve in time. This is 
why it is common practice in the financial industry to re- 
strict the period used to determine the covariance matrix 
to windows of a few years into the recent past. This leads 
to the measurement noise problem alluded above. Now, 
if the "future" large eigenvectors do not coincide with the 
past ones, a supposedly low risk portfolio will in fact be 
exposed to large risks directions in the future. Denoting 
as 10°) the past eigenvectors and the future ones, 
the "unexpected risk" of portfolio can be defined as 
J2^=i ^j{'t']\<l>^y ■ Therefore, as for the quantum dissipa- 
tion problem, the statistics of the overlaps Gij = 



2 



is a crucial piece of information. 

When trying to follow the evolution of a given eigen- 
vector as the perturbation is increased, one imme- 
diately faces a problem when eigenvalues "collide". It 
is well known that true collisions (degeneracies) are non 
generic; the collisions are in fact avoided and levels do 
not cross. However, upon the pseudo-collision of \i and 
Ai+i (say), the eigenvectors \4'i) and |</ii+i) strongly hy- 
bridize. Therefore, single eigenvectors are strongly un- 
stable in time. The idea is then to study the stability 
of the subspace spanned by 2p + 1 several consecutive 

eigenvalues: [\4>l.p): ■ ■ ■ \<t>l) , \'f>l+p)}- Motivated by 
the above examples, we ask the following question: how 
should one choose q> p such that the subspace spanned 
by the set ||(/>fc_g), • . . {(jjl), . • . , has a significant 

overlap with the initial subspace? In order to answer this 
question, we consider the {2q + 1) x {2p + 1) rectangu- 
lar matrix of overlaps G with entries Gij. The {2p + 1) 
non zero singular values 1 > Si > S2 ■ ■ ■ S2p+i > of 
G give full information about the overlap between the 
two spaces. For example, the largest singular value si 
indicates that there is a certain linear combination of the 
{2q + 1) perturbed eigenvectors that has a scalar prod- 
uct si with a certain linear combination of the {2p + 1) 
unperturbed eigenvectors. If S2p+i = Ij then the initial 
subspace is entirely spanned by the perturbed subspace. 
If on the contrary si ^ 1, it means that the initial and 
perturbed eigenspace are nearly orthogonal to one an- 
other. A good measure of the distance D between the two 
spaces is provided by — (In s) = — (X)i 1^ 1) (but 

alternative measures, such as 1 — (s) , could be considered 
as well). Since the singular values s are obtained as the 
square- root of the eigenvalues of the matrix G^G, one 
has D{p,q) = - Indet G^G/2P, where we introduce for 
convenience the notations P = 2p+l, Q = 2q + l. When 
two subspaces of dimensions P and Q are constructed us- 
ing randomly chosen vectors in a space of dimension N, 
one expects accidental overlaps, such that D{p,q) is in 
fact finite. The distance can be calculated exactly using 
Random Matrix Theory tools in the limit N, P,Q ^ oo, 
with a = P/N and (3 ^ Q/N held fixed. The resuh is 



tial eigenvectors as: 
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Ai — Xf Xi — Xi 



where Pij = 



The denominators — Xj remind 



us that eigenvectors are strongly affected by eigenvalue 
pseudo-collisions, as alluded to above. The above expres- 
sion allows one to obtain the overlap matrix G. Keeping 
only the relevant terms to order e^, one finds: 



if i =j, 
if i^j. 
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It is then easy to derive the central result of our study: 
to second order in e, the distance D{p, q) between the 
initial and perturbed eigenspaces is: 
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We now turn to two explicit illustrations, first in the 
context of the GOE matrices, and then in the context of 
empirical correlation matrices. 

Eigenvector stability in the GOE ensemble. We now 
choose Hq and P to be two independent realizations of 
the Gaussian Orthogonal Ensemble of random matrices 
of size Nx N. Wc normalize the elements of Hq and P to 
have variance 1/iV, such that the density of eigenvalues 
p{X) tends to the Wigner semi-circle when N ^ oo. We 
consider the subspace of initial eigenvectors correspond- 
ing to all the eigenvalues A contained in a certain finite 
interval [a, b] included in the Wigner sea [—2, 2]. We want 
to compute the distance D between this subspace and 
the subspace spanned by the pertubed eigenvectors cor- 
responding to all eigenvalues contained in [a — S,b + S]. 
Using the above formula, we find that in the N oo 
limit, D tends to a finite limit as soon as 5 > 0: 
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where 7± = Q;-h/3-2a/3±2^a^(l - a)(l - /3). In other 
words, in that limit, the full density of singular values is 
known. This provides a benchmark to test whether the 
two eigenspaces are accidentally close {D w Drmt), or 
if they are genuinely similar {D <^ Drmt)- 

Endowed with the above formalism, we can now pro- 
ceed to compute D in the case where the perturbation 
is small. Using standard perturbation theory, the per- 
turbed eigenvectors can be expressed in terms of the ini- 



L>(a,6; 5) 
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We checked this formula using numerical simulations, 
with very good agreement for different values of a, b and 
(5, and for e up to O.lj^ It is interesting to study the above 



^ One can be much more precise and compute the full distribution 
of all singular values, giving an indication of their scatter around 
the mean position (s) [Tj- 
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expression in the double limit 5—^0 and A = b — a —> 0. 
One finds: 

D{a,a + A;S) 



2 P(a) ln(A/^) 
^ A 
,2 P(a) 



if (5< A< 1, 
if 1 >(5 > A. 



(4) 



This last expression shows that when the width A of in- 
terval [a, 6] tends to zero, the corresponding eigenvectors 
are scattered in a region of width b much larger than A 
itself as soon as e ^ -s/A- It also shows that for fixed 
A, the distance D diverges logarithmically when 5 — )■ 0. 
This is a consequence of the pseudo-collisions that oc- 
cur between eigenvalues close to the boundaries of the 
interval [a, 6]. When i5 > 0, these pseudo-collisions are 
avoided and D remains finite. When (5 = 0, a more pre- 
cise analysis is needed. One can show that in the limit 
N ^ oo, the following result holds [^: 

D(a,6;(5 = 0)«ln7Ve2 pH^+pM" ^) 

2/>(A)dA 

where A{a, b) is a constant that can be explicitely com- 
puted, and involves the well known two-point function 
g{r) that describes the level-level correlations in the 
GOE. The In TV term can be guessed from the logarithmic 
behaviour of D when 6 0, since one indeed expects the 
divergence to be cut-off when i5 becomes of the order of 
the level spacing, i.e. 6 ~ {Np)~^. As a side remark, we 
note that Eq. Q predicts that a fraction (x of the 
original eigenspace gets shoved away at distances larger 
than S (in eigenvalue space). In the context of the non 
adiabatic evolution of a quantum system [J] , this implies 
that the energy of the system makes jump with a power- 
law distribution of sizes, such that all moments of order 
q > 1 diverge. This means that under an extreme non- 
adiabatic process, the energy is not diffusive but rather 
performs a "Cauchy flight" (i.e. a Levy flight with a tail 
exponent equal to 2). When the perturbation varies on 
a flnite time r, one expects the tails of the jump process 
to be truncated beyond (5 ~ fi/r [4]. 

Eigenvector stability for covariance matrices. As men- 
tionned in the introduction, covariance matrices are cru- 
cial to many problems in theoretical and applied finance. 
Empirically, the conundrum is the following: on the one 
hand, one needs sufficiently long time series in order to 
reduce the statistical noise in the determination of these 
matrices, but on the other hand, one a priori expects that 
covariance matrices do evolve with time. If the measure- 
ment time is too short, the empirical covariance matrix 
will appear to evolve with time, but this may just be 
due to the measurement noise that is not reproducible 
from one period to the next. If the measurement time is 
too long, one may miss important correlation shifts and 
get exposed to unwanted sources of risk. The theory we 
develop here provides a precise estimate of the amount 
of eigenspace instability induced by measurement noise. 



Any extra dynamics of the eigenvectors is therefore at- 
tributable to a genuine market evolution. 

Let {''i(i)}i<i<jv i<t<T ^'^^ '^^ returns (for ex- 

ample on a daily time scale) of N assets over a period 
of length T, which can be seen as a iV x T random rec- 
tagular matrix, Rpf. Let us assume that these returns 
are stationary gaussian variables with zero mean and a 
(true) covariance matrix C. The empirical (or "sam- 
ple") covariance matrice is defined as: E := ^R^j^Rff. 
When T — > oo, E — > C since we assume for now that 
the underlying return process is stationary. For finite T 
however, one has [8]: 



E 



1 

C + £, with Stj^T^Y. (*) '^J (0 - ■ (6) 



Thus the sample covariance matrix E is a perturbed ver- 
sion of C. Each of the £ij has mean and the covari- 
ance structure of the entries of £ is given by (SijSki) = 
(CikCji + CuCjk)/T. Using the same framework as 
above, one can calculate the distance (or overlap) be- 
tween the top P eigenvectors of the true correlation ma- 
trix C and the top Q eigenvector of the empirical corre- 
lation matrix E. [We focus on the top eigenvalues and 
eigenvectors because these represent the most risky direc- 
tions in a financial context.] Provided T is large enough 
for the above perturbation theory to be valid, and upon 
averaging over the measurement noise: 
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where the A^s are the eigenvalues of C, in decreasing or- 
der. One can similarly define the distance between the 
eigenspaces of two independant sample covariance matri- 
ces E'' and E* (determined on two non overlapping time 
periods), with: 



1 ^ 

v = l 



= (t,s), \t-s\>T 
(8) 

In this case, the above formula Eq. ([t]) is simply multi- 
plied by a factor 2. In practice of course one does not 
know the true matrix C, and whether it evolves in time or 
not. If C was time independent, the measurement noise 
on its eigenvalues should be such that, for all i: 



{xt 



\t-s\>T 



4A? 
T ■ 



(9) 



where the Xi are the eigenvalues of the matrix C mea- 
sured empirically using the whole period of time and 
where (•)|t_s|>j' denotes an empirical average over all 
s,t such that |i — s| > T. We determined the empirical 



variograms ((Af — A,-) ) 



\t's\-- 



for i — 1,2, using daily 



returns of iV = 204 stocks of the Nikkei index in the pe- 
riod 2000 - 2010. The resuh is shown in Fig. [l] which 
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clearly shows that there is a genuine evolution of the 
eigenvalues of C with time. For the top eigenvalue, this 
is a well known effect [8]: both the volatility of individ- 
ual stocks and the average correlation between stocks are 
time dependent. We see that the same is true for smaller 
eigenvalues too, reflecting the instability of intra-sector 
correlations. 

What about the eigenvectors ? One could be in a situa- 
tion where the eigenvectors keep a fixed direction through 
time while eigenvalues are moving around. But if the 
eigenvalues themselves are evolving with time, the above 
formulas need to be upgraded. Let us assume that the 
true covariance matrix Ct has time dependent eigenval- 
ues , . . . , but with constant eigenvectors that will 



be denoted 



, \(j)N) as above. For times s < t 



with |t — s| > T, we define the overlap matrix G*' as: 
G^J* = (0f|0*). Under the assumption that the eigen- 
values are varying sufficiently slowly with time, one now 
finds that: 



D{P,Q;s,t) 




(10) 



Up to corrections of order T"'^/^, one can replace in the 
above formulas the A^'* by their empirical estimates. We 
finally compute the theoretical distance Dth{P,Q,T) as 
an average over all s,t such that \t — s\ = t oi the above 
quantity. 

We now compare our null hypothesis formula, Eq. ( 10 ) 
with (a) an empirical determination of Demp 



,{P,Q,t) us- 
ing financial data and (b) a numerical determination of 
Dnum{P, Q, t) using synthetic time series of returns that 
abide to the hypothesis of a covariance matrix Ct with 
fixed eigenvectors, but time dependent eigenvalues. We 
choose an arbitrary (but fixed) set of orthonormal vectors 

■ ■ ,\i^N) and define Ct as Ct = Ejli •^ilV'»)(V'zl, 
where the A* are the empirical eigenvalues obtained on 
the financial return time series. We then use Ct to gener- 
ate synthetic gaussian multivariate returns {ri{u)}. We 
show the corresponding results in Fig. [2] with the choice 
P = 5, Q = 10, as a function of r and for T = 204 days. 



We conclude that (i) the theoretical formula Eq. ( 10 1 is 



indeed in very good agreement with the numerical results 
obtained with synthetic data: Dnum ~ Dth] whereas (ii) 
the financial data clearly departs from the null hypoth- 
esis of constant eigenvectors, since Demp > Dth- The 
same conclusion holds for different values of P, Q. We 
also show the ratio Demp/ Dth for r = T, as a func- 
tion of T. This plot reveals a marked maximum around 
T* « 2 years, suggesting that the correlation matrix has 
some true dynamical evolution with a mean reversion 
time around T*, with important consequences for risk 
analysis. 

As a conclusion, we have developed general tools to de- 




FIG. 1. Empirical variograms ((A^ — Xl)^)\t-s\=T for i — 1 
(left) and i = 2 (right). The horizontal dotted lines corre- 
spond to the theoretical prediction for a time independent 
correlation matrix. 





FIG. 2. Left: Dth, Dnum, Demp as a function of r with P = 
5, Q = 10 and T = 204 for iV = 204 stocks of the Nikkei index 
in the period 2000-2010. Note that Demp reaches a maximum 
for T « 500, which is significantly smaller than the random 
benchmark Drmt ~ 0.83 for P = 5, Q = 10, = 204. Right: 
The ratio Demp/ Dth as a function of T, for t — T, P = 5,Q = 
10, revealing the same maximum for T* ~ 500. 



scribe the dynamics of eigenvectors under the influence of 
random perturbations. We obtained exact results in the 
context of random matrices in the GOE ensemble and 
correlation matrices, with in mind applications to finan- 
cial risk, but hopefully the ideas and methods introduced 
here can be used in a much broader context. 
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